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Full text available:^ pdf(356.60 KB) 



The use of markup languages like SGML, HTML or XML for encoding the strucutre of 
documents or linguistic data has lead to many databases where entries are adequately 
described as trees. In this context querying formalisms are interesting that offer the 
possiblity to refer both to textual content and logical structure. We consider models where 
the strucutre specified in a query is not only used as a filter, but also for selecting and 
presenting different parts of the data. If answers are formaliz ... 

Keywords: SGML, XML, answer presentation, information retrieval, logic, query languages, 
semistructured data, structured documents, tree databases, tree matching 
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Our aim is to develop new database technologies for the approximate matching of 
unstructured string data using indexes. We explore the potential of the suffix tree data 
structure in this context. We present a new method of building suffix trees, allowing us to 
build trees in excess of RAM size, which has hitherto not been possible. We show that this 
method performs in practice as well as the 0(n) method of Ukkonen [70]. Using this 
method we build indexes for 200 Mb of protein and 3 ... 

Keywords: Approximate matching. Biological sequence, Database index. Suffix tree 
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^^^^^^ review 

MPEG-7 constitutes a promising standard for the description of multimedia content. It can 
be expected that a lot of applications based on MPEG-7 media descriptions will be set up in 
the near future. Therefore, means for the adequate management of large amounts of 
MPEG-7-compliant media descriptions are certainly desirable. Essentially, MPEG-7 media 
descriptions are XML documents following media description schemes defined with a variant 
of XML Schema. Thus, it is reasonable to investigate curren ... 

Keywords: MPEG-7, XML database systems, multimedia databases 
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Most databases contain "name constants" like course numbers, personal names, and place 
names that correspond to entities in the real world. Previous work in integration of 
heterogeneous databases has assumed that local name constants can be mapped into an 
appropriate global domain by normalization. However, in many cases, this assumption does 
not hold; determining if two name constants should be considered identical can require 
detailed knowledge of the world, the purpose of the ... 
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Reee J. Miller 
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Schematic heterogeneity arises when information that is represented as data under one 
schema, Is represented within the schema (as metadata) in another. Schematic 
heterogeneity is an important class of heterogeneity that arises frequently in integrating 
legacy data in federated or data warehousing applications. Traditional query languages and 
view mechanisms are insufficient for reconciling and translating data between schematically 
heterogeneous schemas. Higher order query languages, that ... 
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XML is rapidly emerging as a standard for exchanging business data on the World Wide 
Web. For the foreseeable future, however, niost business data will continue to be stored in 
. relational database systems. Consequently, if XML is to fulfill its potential, some mechanism 
is needed to publish relational data as XML documents. Towards that goal, one of the major 
challenges is finding a way to efficiently structure and tag data from one or more tables as 
a hierarchical XML document. Different alterna ... 

Keywords: Publishing, Relational databases, XML 
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This paper describes a nnethodology for the development of WWW applications and a tool 
environment specifically tailored for the methodology. The methodology and the 
development environment are based upon models and techniques already used in the 
hypermedia, information systems, and software engineering fields, adapted and blended in 
an original mix. The foundation of the proposal is the conceptual design of WWW 
applications, using HDM-lite, a notation for the specification of structure, nav ... 

Keywords: HTML, WWW, application, development, Intranet, modeling 
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A new kind of data model has recently emerged in which the database is not constrained by 
a conventional schema. Systems like ACeDB, which has become very popular with 
biologists, and the recent Tsimmis proposal for data integration organize data in tree-like 
structures whose components can be used equally well to represent sets and tuples. Such 
structures allow great flexibility y In data representation. What query language is 
appropriate for such structures? Here we propose a simple language Un ... 
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Peter G. Anick, Rex A, Flynn 
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There has been a great deal of interest within the Infornnatlon Retrieval connmunity in 
evaluating the use of linguistic knowledge to improve the Indexing and searching of textual 
databases. Such systenris must often employ a lexicon to store information about the words 
and phrases comprising the application's domain. Unlike a static lexicon, a dynamic lexicon 
raises practical concerns about the coordination between the state of the lexicon and IR 
indexing sche ... 

11 Document querying and transformation: Querying XML documents bv dynamic 
shredding 
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October 2004 Proceedings of the 2004 ACM symposium on Document engineering 

Full text available: ^pclf(251.39 KB) Additional Information: full citation , abstract , references , index terms 

With the wide adoption of XML as a standard data representation and exchange format 
querying XML documents becomes increasingly important. However relational database 
systems constitute a much more mature technology than what is available for native 
storage of XML. To bridge the gap one way to manage XML data is to use a commercial 
relational database system. In this approach users typically first ' 'shred" their documents 
by isolating what they predict to be meaningful fragments then store t ... 
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Novennber 1997 The VLDB Journal — Tlie International Journal on Very Large Data 

Bases, volume 6 Issue 4 
Full text available: ^ pdf(184.18 KB) Additional Information: full citation , abstract , citings , index terms 

The combination of SGML and database technology allows to refine both declarative and 
navigational access mechanisms for structured document colljection: with regard to 
declarative access, the user can formulate complex information needs without knowing a 
query language, the respective document type definition (DTD) or the underlying modelling. 
Navigational access is eased by hyperlink-rendition mechanisms going beyond plain link- 
integrity checking. With our approach, the database-internal repres ... 

Keywords: Document query languages, Navigation, OODBMSs, SGML 



1 4 Next-Gen Open Hypermedia, Part One: An infrastructure for open latent semantic 
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June 2002 Proceedings of the thirteenth ACM conference on Hypertext and 
hypermedia 
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The more the web grows, the harder it is for users to find the information they need. As a 
result, it is even more difficult to identify when documents are related. To find out that two 
or more documents are in fact related, users have to navigate by the documents in carry 
out an analysis about their content. This paper presents an infrastructure allowing the use 
of latent semantic analysis and open hypermedia concepts in the automatic identification of 
relationships among web pages. Latent Sema ... 

Keywords: automatic linking, information integration, information retrieval, open 



http://portal.acm.org/results.cfin?coll=ACM&dl=ACM&CFID=427 1 8536&CFTOKEN=l 64... 4/25/05 



Results (page 1): structured column and unstructured column and database and index and ... Page 5 of 6 



hypermedia, semantic structures, web 



15 IS '97: model curriculum and guidelines for undergraduate d eg ree programs in 
information systems 

Gordon B. Davis, John T. Gorgone, J. Daniel Couger, David L Feinstein, Herbert E. 
Longenecker 

December 1996 ACM SIGMIS Database , Guidelines for undergraduate degree programs 
on Model curriculum and guidelines for undergraduate degree 
programs in information systems, volume 28 issue i 

Full text available: ^ pdf(7.24 MB) Additional Information: full citation , citings 



16 UnQL: a query language and algebra for sennistructured data based on structural 
recursion 

Peter Buneman, Mary Fernandez, Dan Suciu 

March 2000 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 1 
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This paper presents structural recursion as the basis of the syntax and semantics of query 
languages for senrilstructured data and XML. We describe a sinnple and powerful query 
language based on pattern matching and show that It can be expressed using structural 
recursion, which is introduced as a top-down, recursive function, similar to the way XSL is 
defined on XML trees. On cyclic data, structural recursion can be defined in two equivalent 
ways: as a recursive function which evaluates the data t ... 
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The integration of distributed, heterogeneous databases, such as those available on the 
World Wide Web, poses many problems. Herer we consider the problem of integrating data 
from sources that lack common object identifiers. A solution to this problem is proposed for 
databases that contain informal, natural-language "names" for objects; most Web-based 
databases satisfy this requirement, since they usually present their information to the end- 
user through a veneer of text. We des ... 
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The number, size, and user population of bibliographic and full-text document databases 
are rapidly growing. With a high document arrival rate, it becomes essential for users of 
such databases to have access to the very latest documents; yet the high document arrival 
rate also makes it difficult for users to keep themselves updated. It is desirable to allow 
users to submit profiles, i.e., queries that are constantly evaluated, so that they will be 
automatically informed of new additions tha .:. 
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Ee-Peng Lim, Ying Lu 

July 1999 ACM Transactions on Information Systems (TOIS), volume i7 issue 3 



The main purpose of a digital library is to facilitate users easy access to enormous amount 
of globally networked information. Typically, this information includes preexisting public 
library catalog data, digitized document collections, and other databases. In this article, we 
describe the distributed query system of a digital library prototype system known as HARP. 
In the HARP project, we have designed and implemented a distributed query processor and 
its query front-end to support integr ... 
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